Exploring Differential Topic Models for Comparative Summarization of Scientific Papers
نویسندگان
چکیده
This paper investigates differential topic models (dTM) for summarizing the differences among document groups. Starting from a simple probabilistic generative model, we propose dTM-SAGE that explicitly models the deviations on group-specific word distributions to indicate how words are used differentially across different document groups from a background word distribution. It is more effective to capture unique characteristics for comparing document groups. To generate dTM-based comparative summaries, we propose two sentence scoring methods for measuring the sentence discriminative capacity. Experimental results on scientific papers dataset show that our dTM-based comparative summarization methods significantly outperform the generic baselines and the state-of-the-art comparative summarization methods under ROUGE metrics.
منابع مشابه
SciSumm: A Multi-Document Summarization System for Scientific Articles
In this demo, we present SciSumm, an interactive multi-document summarization system for scientific articles. The document collection to be summarized is a list of papers cited together within the same source article, otherwise known as a co-citation. At the heart of the approach is a topic based clustering of fragments extracted from each article based on queries generated from the context sur...
متن کاملTowards Multi-Document Summarization of Scientific Articles:Making Interesting Comparisons with SciSumm
We present a novel unsupervised approach to the problem of multi-document summarization of scientific articles, in which the document collection is a list of papers cited together within the same source article, otherwise known as a co-citation. At the heart of the approach is a topic based clustering of fragments extracted from each co-cited article and relevance ranking using a query generate...
متن کاملAutomatic Generation of Related Work Sections in Scientific Papers: An Optimization Approach
In this paper, we investigate a challenging task of automatic related work generation. Given multiple reference papers as input, the task aims to generate a related work section for a target paper. The generated related work section can be used as a draft for the author to complete his or her final related work section. We propose our Automatic Related Work Generation system called ARWG to addr...
متن کاملSustainable Supply Chain Network Design: A Review on Quantitative Models Using Content Analysis
The purpose of this paper is to develop a systematic literature review on the subject of sustainable supply chain network design during 1990-2016, through a review of 261 papers. In this study, qualitative technique for conducting a systematic literature review was used. To systematize and make the literature review more accurate, content analysis method was used that include data collect...
متن کاملTopic Modeling and Classification of Cyberspace Papers Using Text Mining
The global cyberspace networks provide individuals with platforms to can interact, exchange ideas, share information, provide social support, conduct business, create artistic media, play games, engage in political discussions, and many more. The term cyberspace has become a conventional means to describe anything associated with the Internet and the diverse Internet culture. In fact, cyberspac...
متن کامل